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Executive Summary 

This study examines the relationship between students' SAT® essay scores and college 
outcomes, including first-year grade point average (FYGPA) and first-year English course 
grade average (FY EngGPA), overall and by various demographic and academic performance 
subgroups. Results showed that the SAT essay score has a positive relationship with 
both FYGPA and FY EngGPA. Additionally, the SAT essay score offers useful and unique 
information in understanding how students perform in college generally, and in their English 
course work, after controlling for relevant academic variables (e.g., high school grades and 
SAT scores). As we continually seek to improve students' writing performance in the United 
States, the findings warrant further exploration of the value and role of the SAT essay in 
college admission and placement, particularly because students who take the SAT and apply 
to college are already providing this information to admission offices. 
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Introduction 

The SAT writing section, which was added to the SAT in March 2005, includes both multiple- 
choice items and an essay. The multiple-choice items assess the ability to identify sentence 
errors and improve grammar and usage, and students have 35 minutes to answer 49 multiple- 
choice questions. The essay tests a student's ability to write effectively in response to an 
issue provided in a prompt, and students have 25 minutes to formulate their response. 

The SAT writing section is scored on a 200-800 scale, with the multiple-choice items 
accounting for approximately 70% of a student's writing score and the essay accounting for 
the remaining 30%. The essay is scored holistically by two independent readers on a scale 
of 1 to 6. If the two readers' scores differ by more than one point, a third reader scores the 
essay. Approximately 1 % of all scored essays require a third reader (College Board, n.d.).The 
combined score for both readers ranges from 2 to 12, and essays that are not written on the 
essay assignment receive a score of 0. Official SAT score reports that are sent on behalf of 
applicants to colleges and universities include the essay subscore. 

There has been a great deal of validity research on the SAT writing section as a whole since 
its introduction in 2005 (e.g., Kobrin, Patterson, Shaw, Mattern, & Barbuti, 2008; Mattern, 
Patterson, Shaw, Kobrin, & Barbuti, 2008; Patterson, Mattern, & Kobrin, 2009; Patterson & 
Mattern, 2010); however, there has been far less research focused on the essay component 
alone, and particularly on the relationship between the SAT essay score and college 
outcomes. Early research on the SAT essay (e.g., Kobrin, Deng, & Shaw, 2007) focused 
on examining data to study media criticisms of the SAT which claimed, for example, that 
the essay score merely reflected the number of words the student wrote or the nature of 
the examples provided. Kobrin, Deng, and Shaw (2011) examined the relationship between 
features of student responses on the essays and the scores they received. Their results 
indicated that essay length was related to scores, but the correlation was not nearly as 
high as previous critics have claimed. After controlling for SAT critical reading and writing 
multiple-choice scores, the essay features with the largest positive effect sizes included, 
using a five-paragraph theme and using academic evidence, although these effect sizes are 
still considered small. The features with the largest negative effect sizes included using no 
evidence or support and ending the essay mid-sentence. 

Many educators have advocated for essay assessments (also often referred to as direct 
writing assessments) over indirect writing assessments such as multiple-choice questions. 
Direct writing assessments are viewed as more authentic and as having more positive 
"washback" effects than indirect writing assessments. These washback effects are similar to 
the notion of consequential validity, pertaining to the social consequences of using a particular 
test for a particular purpose (Messick, 1989; Noeth & Kobrin, 2007). That is, arguments have 
been made for more essay testing because traditional multiple-choice measures of writing 
skills are viewed as having the unintended side effect, or negative consequence, of causing 
teachers to teach and students to focus on sentence-level problems while ignoring the more 
global aspects of writing (Breland, Bridgeman, & Fowles, 1999). 

However, the greater authenticity of direct writing assessment is usually offset by the lower 
reliability of scores compared to indirect writing assessments. Essay assessments have lower 
reliability than multiple-choice tests due to issues related to the number and selection of 
tasks, and issues related to scoring. When the predictive value of a test is measured by the 
correlation of students' scores on the test with college outcome measures such as first-year 
grade point average and grades in college courses, the indirect writing assessments will 
always prevail. 
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Nevertheless, research has shown the value of essay tests as predictors of college 
performance, especially when they are used in conjunction with multiple-choice writing 
test scores. For example, Breland (1983) cited five studies providing useful evidence for 
the incremental validity of direct writing assessments (essay tests) above high school GPA 
or rank, SAT scores, and other measures. In general, studies of writing tests for predicting 
college outcomes have suggested that both indirect and direct writing tests (essays) may 
be a slightly better predictor of English course grades than of more general outcomes such 
as GPA or instructors' ratings as criteria. Bridgeman and Lewis (1991 ) found that direct 
writing tests had greater predictive value in courses that were more likely to use essays in 
the determination of course grades, such as English or history courses rather than other 
outcomes. 

Examining writing performance as the outcome, Breland, Kubota, and Bonner (1999) studied 
the predictive validity of the SAT Subject Test in writing (the precursor to the SAT writing 
section) for predicting writing performance in college English courses for students entering 
eight different colleges in the fall of 1996. College writing performance was based on eight 
writing samples (drafts and final papers submitted in response to four take-home writing 
assignments) submitted as part of regular course work by approximately 300 students. The 
SAT Subject Test writing scores were also used to predict students' self-reported writing 
ability. The multiple-choice and essay components were both statistically significant predictors 
of both English course writing performance and students' self-reports; however, the multiple- 
choice score was a stronger predictor of writing performance. The authors noted that a 
combination of the multiple-choice and essay components provided a better prediction than 
either component used alone. 

Similarly, Matzen and Hoyt (2004) found that a timed essay test, in addition to a multiple 
choice writing assessment, was most valuable for writing course placement at an open 
admission, four-year public college in the Western United States. Students were misplaced 
into writing courses when relying solely on multiple-choice writing scores (about 30% were 
misplaced into lower level courses and about 9% were misplaced into higher level courses). 
Contrary to many other studies comparing direct and indirect writing assessments, Matzen 
and Hoyt found that if only one single measure could be used for placement, the timed 
essays were more accurate in placing students into English courses than the more indirect 
measures of writing. 

Subgroup differences on both direct and indirect writing tests are a persistent concern in 
measurement. Breland et al. (1999) reported that women tend to perform better than men 
do on both types of writing assessments (Willingham & Cole, 1997). Minority ethnic groups 
and groups for whom English is not their best language perform relatively less well on indirect 
tests of writing than they do on direct writing assessments (Pomplun, Wright, Oleka, & 
Sudlow, 1992). To the extent that essay assessments are less reliable than multiple-choice 
tests, they would be expected to show smaller group differences. 

The purpose of this study was to more deeply understand SAT essay performance by various 
subgroups and demonstrate the utility of the SAT essay, a direct writing assessment, as a 
predictor of college performance across students. A graphical or straightforward approach is 
used in this study to show what SAT essay scores add above HSGPA and composite (added 
together) SAT critical reading, math, and writing (multiple choice only) scores to the prediction 
of first-year college grades and first-year English course grades. 
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Data and Variables 

Sample 

The sample in this study was taken from the 110 four-year institutions that participated in the 
national SAT Validity Study and provided first-year performance data on the first-year, first- 
time students that entered college in the fall of 2007 (for more information on the sample, 
see Patterson et al., 2009). This sample of students (N = 216,081) was matched to the 2007 
College-Bound Seniors database, which included the SAT scores and SAT Questionnaire 
responses of students who reported that they would graduate from high school in 2007. In 
order to be included in the study sample, students had to have valid SAT scores, first-year 
grade point average (FYGPA), first-year English grade point average (FY EngGPA), retention 
to the second-year information, and a self-reported high school grade point average (HSGPA) 
from the SAT Questionnaire, resulting in a final sample size of 120,897 students. 

Measures 

Demographic information. Self-reported gender, ethnicity, best language spoken, first 
language spoken, and highest parental educational level were obtained from student 
responses to the SAT Questionnaire, which is completed during registration for the SAT. 

High school grade point average (HSGPA). Self-reported HSGPA was obtained from the 
SAT Questionnaire. This variable is on a 12-point scale with the following values: A+ (97-100; 
4.33); A (93-96; 4.00); A- (90-92; 3.67); B+ (87-89; 3.33); B (83-86; 3.00); B- (80-82; 2.67); 
C+ (77-79; 2.33); C (73-76; 2.00); C- (70-72; 1.67); D+ (67-69; 1.33); D (65-66; 1.00); E or F 
(Below 65; 0.00). 

First-year grade point average (FYGPA). Each participating institution supplied FYGPA 
values for the first-time, first-year students who entered in fall 2007. The range of FYGPAs 
across institutions was from 0.00 to 4.19. 

First-year English grade point average (FY EngGPA). Each participating institution supplied 
all first-year course grades for the entering class of fall 2007. Each course was coded for its 
specific content area for analysis and all English course grades that a student took in the first 
year were averaged to arrive at the FY EngGPA. When Shaw and Patterson (2010) examined 
first-year college course work in English across a national sample of four-year institutions, they 
found that the vast majority of English courses taken were composition courses (72%). This 
is likely because most institutions have writing course requirements in the first-year. For the 
current study, when a student took one English course in the first year, this one course grade 
served as the FY EngGPA. The range of FY EngGPA across institutions was from 0.00 to 4.33. 

Retention to the second year. In the fall of 2008, each participating institution supplied 
second-year retention information for students in the entering class of fall 2007. The retention 
indicator was dichotomously coded to indicate either "yes, the student returned" or "no, the 
student did not return." 

SAT scores. Official SAT scores were obtained from the 2007 College-Bound Seniors 
database for the analyses. This database is comprised of the students who participated in the 
SAT program and reported that they would graduate from high school in 2007. A student's 
most recent score was used in the analyses. The SAT is comprised of three sections: critical 
reading, math, and writing, and each section is scored on a 200-800 scale The SAT writing 
section has two separate scores for the essay and the multiple-choice items, and these 
scores are combined to form a total SAT writing score. 
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Method 

The analyses in this study included the calculation of means, standard deviations, and effect 
sizes or standardized differences using Cohen's d( 1988) for SAT essay scores by subgroups. 

In this study, Cohen's dis calculated by subtracting the total group SAT essay score mean 
from the subgroup SAT essay score mean and dividing that resulting value by the total group's 
SAT essay score standard deviation. In order to interpret Cohen's d, Cohen provided basic 
guidelines, characterizing an effect size of .2 as small, an effect size of .5 as medium, and an 
effect size of .8 as large. However, Cohen emphasized that when interpreting effect sizes, 
the researcher must contextualize the data and use personal judgment to assess the practical 
significance of an effect. 

Also, a graphical approach was incorporated to show what SAT essay scores add above 
HSGPA and composite SAT critical reading, math and writing multiple-choice scores in relation 
to college outcomes. This approach was advanced by Bridgeman, Pollack, and Burton (2004), 
and is often referred to as the straightforward approach because of its intuitiveness and 
simplicity as compared to the more traditional approach of showing incremental validity using 
multiple regression. As Bridgeman et al. noted, it is important to find ways to present the 
complicated concept of "variance explained" by different predictors in more straightforward 
ways for deeper understanding among broader audiences. In the current study, the academic 
performance groupings of High Achievers, Moderately High Achievers, Average Achievers, and 
Below Average Achievers (based on students' SAT critical reading, math, and writing multiple- 
choice scores and HSGPA) were based on those used by Bridgman et al., after confirming 
their face validity with regard to student performance standards. 


Results 

Descriptive Statistics 

Means, standard deviations, and effect sizes for essay scores by student subgroups are 
included in Table I.The overall mean essay score for the sample was 7.82 (SD = 1.49), 
indicating that this sample is more highly able than the typical population of college-bound 
seniors. In 2008, the first year that the essay subscores were reported in the annual College- 
Bound Seniors reports, the overall mean essay score was 7.10 with a standard deviation of 
1.70. Consistent with trends found in the population, the female students in the study sample 
tended to have higher mean essay scores ( M = 7.88, SD = 1.41, d= 0.04) than male students 
( M = 7.73, SD = 1.57, d = -0.06); however, the difference was quite small. With regard to 
racial/ethnic differences in mean essay score, Asian students had the highest mean scores 
(M = 8.13, SD = 1.56, d = 0.21), while black/African American students had the lowest mean 
scores (M = 7.15, SD= 1.44, d= -0.45). Students whose parents had received no high school 
diploma had a mean essay score of 7.16 (SD = 1.48, d = -0.45) and those with a parent who 
attended graduate school had a mean essay score of 8.21 (SD = 1.47 d = 0.26). 

Mean essay score was also reported for best language and first language spoken subgroups. 
Interestingly, these variables followed different trends from each other. There were larger 
differences in mean essay scores for the best language subgroups, with the English as Best 
Language subgroup having the highest mean (M = 7.84, SD = 1.48, d = 0.01), followed by the 
English and Another Language subgroup (M = 7.63, SD = 1.56, d = -0.13), and the Another 
Language subgroup (M = 7.06, SD = 1.63, c/ = -0.51). For the first language spoken subgroups, 
those in the English and Another Language subgroup had the highest mean essay score 
(M = 7.86, SD = 1.56, d = 0.03), followed by those in the English as Best Language subgroup 
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( M = 7.82, SD = 147, d = 0.00), and those in the Another Language subgroup (M = 7.71 , 

SD = 1.60, d= -0.07). 

It is useful to note that many of the standardized differences on the SAT essay by subgroup 
can be compared to standardized differences by subgroup in earlier research on the SAT 
critical reading (or SAT Verbal in research on the test administered before March 2005) 
and math sections. In particular, Kobrin, Sathy, and Shaw (2007) presented such subgroup 
differences on the SAT and other academic measures over a period of 20 years. Examining 
the differences in magnitude of the standardized differences on the essay by subgroup as 
compared to the other SAT sections, it is apparent that there tend to be smaller standardized 
differences on the SAT essay than the SAT critical reading (and math sections), although 
there were some exceptions. The standardized differences in SAT critical reading scores 
for Hispanics students ranged from -0.49 in 2003 to -0.41 in 2006, while the standardized 
difference on the SAT essay score for Hispanic students in this study was -0.26 in this study. 
For black/African American students, the standardized differences in SAT critical reading 
scores ranged from -0.72 in 1987 to -0.61 in 2006. However, the standardized difference in 
SAT essay score for black/African American students in this study was -0.45. An exception 
is that the standardized difference on the essay for Asian students in this study was 0.21 ; 
however, the standardized differences on SAT critical reading scores in Kobrin et al. tended 
to be somewhat smaller for this subgroup (ranging from -0.08 in 1996 to 0.06 in 2006), while 
the difference was typically much larger for SAT math (ranging from 0.45 in 1996 to 0.52 in 
2006). For students reporting their best language to be a language other than English, the 
standardized difference on the SAT essay (-0.51) was much smaller than that of SAT critical 
reading in the Kobrin et al. study (ranging from -1.05 in 1996 to -0.79 in 2006) but much 
larger than that of SAT math (ranging from 0.05 in 1996 to 0.12 in 2006), which requires less 
language proficiency. 

Table 2 shows the relationship between SAT essay score and academic performance 
subgroups. There were rather large differences in mean essay score by SAT performance 
subgroups, which were created based on students' composite or sum of SAT critical reading, 
math, and writing (multiple choice only) scores. Mean essay score followed the trend of 
increasing SAT performance with the strongest SAT performance subgroup having a mean 
essay score of 9.56 (SD = 1.34, d = 1.17) and the weakest SAT performance subgroup having 
a mean essay score of 6.04 (SD = 1.42, d = -1.19). Differences in mean essay score by HSGPA 
groups were not quite as pronounced as in the SAT performance subgroups, though the 
patterns of essay performance were the same. Students with a HSGPA above an A- had a 
mean SAT essay score of 8.06 (SD = 1.45, d = 0.16), whereas students with a HSGPA below 
B- had a mean SAT essay score of 6.82 (SD = 1.49, d = -0.67). Mean SAT essay performance 
was also examined by college performance subgroups, including FYGPA, FY EngGPA, and 
retention to the second year. Students with a FYGPA above a B+ had a mean SAT essay score 
of 8.28 (SD = 1.43, d = 0.31 ), while students in the lowest FYGPA subgroup of Below D+ had 
a mean SAT essay score of 6.79 (SD = 1.48, d = -0.69). For FY EngGPA, subgroups showed 
similar trends in mean SAT essay score to FYGPA subgroups, although the magnitude of 
the differences was smaller than for FYGPA. The group with a FY EngGPA above B+ had a 
mean SAT essay score of 8.10 (SD = 1.43, d = 0.19), while students in the lowest FY EngGPA 
subgroup of Below D+ had a mean SAT essay score of 6.97 (SD = 1.52, d = -0.57). With 
regard to retention to the second year, those students who did return had a mean SAT essay 
score of 7.87 (SD = 1.48, d = 0.03) versus a mean SAT essay score of 7.37 (SD = 1.50, 
d= -0.30) for students who did not return. 
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Figure 1 shows the relationship between SAT essay score and mean FYGPA (black line) and 
mean FY EngGPA (blue dotted line). This figure shows that as essay scores increase, both 
GPAs also increase, and the mean FY EngGPA tends to be slightly higher than the mean 
FYGPA at each essay score point, except for those receiving the lowest essay scores. 

Straightforward Approach 

Because the essay score scale is much smaller, and scores are less variable than scores on 
the other SAT sections, using a traditional regression approach to determine the contribution 
of essay scores to the prediction of college grades would most likely lead one to incorrectly 
surmise that essay scores do not provide any increment to the prediction. As will be shown 
in our analysis, this conclusion would be misinformed. The straightforward approach, or 
graphical presentation of the data, depicts a different story. 

Figure 2 shows the clear relationship between SAT essay scores and FYGPA, after essentially 
controlling for SAT scores and FISGPA. Table 3 includes the values that Figure 2 is based on. 
The table and figure group students by different achievement levels, based on their HSGPA 
and SAT critical reading + math + writing multiple choice scores, into: High Achievers, 
Moderately High Achievers, Average Achievers, and Below Average Achievers. While the 
trends are clear in the table, the graph more clearly shows the relationship of interest. With 
few exceptions (primarily due to smaller sample sizes), one can see that within all groups 
of similarly academically able students, as essay scores increase, so too does the mean 
FYGPA. The standardized differences in Table 3 show that within achievement levels, there are 
sometimes sizable differences in FYGPA by SAT essay score. For example, among Average 
Achievers, those with an essay score of 4 had a mean FYGPA of 2.23 ( SD = 0.79, d = -0.37), 
whereas those with an essay score of 10 had a mean FYGPA of 2.69 (SD = 0.60, d = 0.28). 

Not surprisingly, there were too few or no students with essay scores of 9, 10, 11, or 12 in the 
Below Average Achievers group to report on, just as there were too few High Achievers with 
an essay score of 2 or 3 to report on. Also, within each essay score point (e.g., all students 
with a score of 8, for example), trends across achievement level groups remained consistent 
with mean FYGPA, increasing steadily by achievement level. 

Figure 3 depicts the relationship between SAT essay scores and FY EngGPA, while grouping 
students by SAT scores and HSGPA, or controlling for academic ability. Table 4 includes 
the values that Figure 3 is based on. Similar to the FYGPA relationship, within all groups of 
similarly academically able students, as essay scores increase, so too does the mean FY 
EngGPA. The standardized differences in Table 4 indicate that within achievement levels, 
there are sometimes sizable differences in FY EngGPA by SAT essay score. The differences 
in FY EngGPA by SAT essay score appear to be most pronounced for the Average Achievers 
group. For example, among Average Achievers, those with an essay score of 4 had a mean 
FY EngGPA of 2.31 (SD = 1.08, d = -0.39), whereas those with an essay score of 10 had a 
mean FY EngGPA of 2.96 (SD = 0.75, d = 0.30). Also, within essay score, trends remained 
consistent across achievement level groups, with mean FY EngGPA increasing steadily by 
achievement level. The few disrupted trend patterns appear to be due to small sample size. 

Discussion 

The results of this study show that the SAT essay has a positive relationship with both 
FYGPA and FY EngGPA. This relationship remains apparent even when students are grouped 
by academic ability (HSGPA and SAT composite score, not including the essay), providing 
evidence for the predictive validity of the SAT essay with FYGPA and FY EngGPA. In other 
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words, when HSGPA and SAT scores are held constant, the essay still provides important 
information regarding college performance. 

With regard to subgroup differences on the SAT essay, consistent with prior research (Breland 
et al., 1999), female students tended to have higher essay scores than male students. 
Interestingly, however, Asian students in this sample outperformed other ethnic groups 
on the SAT essay and, with the exception of white students, outperformed other ethnic 
groups on the total SAT writing section in the 2007 College-Bound Seniors report (d = 0.17). 
This is unlike the historic performance of Asian students on the SAT Subject Test in writing 
(multiple choice and essay), the precursor to the SAT writing section on which Asian students 
performed about .3 of a standard deviation lower than the total group (Kobrin et al., 2007). 

This performance difference may be due to differences in the SAT Subject Test sample, as 
the students who took the SAT Subject Test in writing tended to be academically stronger 
students than general SAT takers because the SAT Subject Test in writing was used in 
admission to the University of California and other prestigious institutions. 

There are clear differences in essay performance when the question of whether a language 
other than English is a student's first language or best language is considered. There were 
larger differences in SAT essay means when the students' self-reported best language was 
considered, as opposed to first language. As best language is likely indicative of students' 
self-efficacy beliefs in the English domain (Bandura, 1993, 1997), it is not surprising that this 
is highly related to performance on the SAT essay. Additionally, SAT essay means increased 
as all academic measures increased (e.g., the FYGPA and EISGPA subgroups), establishing a 
clear, positive relationship between the SAT essay and academic performance in this study, 
including the more noncognitively based retention variable. 

The straightforward approach (Bridgeman et al., 2004) was used in this study because of the 
way it accessibly shows relationships between variables, requiring only an interpretation of 
trends in group means as opposed to more complicated statistics. Mattern, Kobrin, Patterson, 
Shaw, and Camara (2009) believe that this approach to explaining and showing relationships 
between variables such as the SAT and FYGPA may result in a better understanding and 
decreased misconceptions of research findings because the traditional multiple regression 
results providing measures of the "variance accounted for" are widely misinterpreted. By 
examining the relationship between SAT essay score and FYGPA or FY EngGPA, when 
students are grouped by academic ability, one can test whether the SAT essay is of additional 
assistance in predicting college performance above traditional measures such as HSGPA and 
SAT section scores. If the SAT essay did not help predict FYGPA or FY EngGPA, above these 
measures, one would see a straight line across all High Achievers (same mean FYGPA or 
mean FY EngGPA) and a straight line below for all Average Achievers, for example. This was 
not, however, the case. Students at each of the academic ability levels showed increases 
in mean FYGPA or FY EngGPA with increases in SAT essay score, except in a few cases, 
where it appears that the sample was quite small and therefore less stable (at the lower 
essay scores in this college-going sample). As a practical example, among all those students 
considered to be High Achievers, those with the lowest essay score that could be reported 
(4), had a FYGPA of 2.94, while those with the highest essay score (12) had a mean FYGPA 
of 3.50. This shows that by further grouping these high achieving students based on their 
performance on a 25-minute direct writing assessment, one will find differences in how these 
students will perform in college — evident in both their FYGPA and also specifically in their 
first-year English course work performance. 

There are a few limitations of the study that should be reported. As the study sample included 
somewhat higher-achieving, college-going students, there were fewer students with essay 
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scores at the lower end of the scale, which limits the generalizability and conclusions that can 
be drawn at that part of the essay score scale. Also, the academic ability categories used in 
the study were created based on those used in Bridgeman et al. (2004) after confirming their 
face validity with regard to student performance standards. Although we wouldn't expect the 
patterns in the straightforward approach graphs to considerably change if the academic ability 
categories were altered, it would be useful to confirm this in future research. 

Future research should explore the role of the SAT essay in college English or writing course 
placement decisions, given the relationship shown between essay score and English 
performance averaged across courses. It would also be useful to explore how the relationship 
between essay score and English grades might change based on type of English course 
examined, for example, a composition versus a literature course. Furthermore, research 
is planned to explore those students who may have discrepant performance on the essay 
versus multiple-choice items on the SAT writing section, and determine the additional 
information garnered for admission or placement by having the essay on the test for these 
students. This would be similar to a study by Shaw, Mattern, and Patterson (2011 ) that found 
that after controlling for relevant student characteristics and prior academic performance, 
an SAT critical reading-writing discrepancy still had an effect on first-year GPA as well as on 
English course grades in college. Also, the role of student's first versus best language on 
direct versus indirect writing assessments may provide useful information in understanding 
our increasingly linguistically diverse college applicants' writing abilities. This information may 
help to develop and target interventions to improve writing performance. 

Conclusion 

This study found that the SAT essay score provides useful and unique information in 
understanding how students will perform in college generally, and in their English course 
work, information that is independent of the other academic variables which we held 
constant. In addition, the SAT essay appears to demonstrate somewhat smaller demographic 
subgroup differences in performance than on the SAT critical reading and math sections. 

This information, taken together, indicates that the essay may be a currently underutilized, 
but valuable tool in college admission and placement. With so many students struggling with 
college-level writing (The National Commission on Writing in America's Schools and Colleges, 
2003), it would seem worthwhile to further explore this academic tool, particularly because 
students who take the SAT and apply to college are already providing this information to 
admission offices. 
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Table 1. 

The Relationship Between SAT Essay Score and Demographic Characteristics 




n 

M 

SD 

d 

Gender 

Female 

66,344 

7.88 

1.41 

0.04 

Male 

54,553 

7.73 

1.57 

-0.06 


American Indian 

623 

7.64 

1.46 

-0.12 


Asian 

10,393 

8.13 

1.56 

0.21 


Black/African American 

8,614 

7.15 

1.44 

-0.45 

Race/Ethnicity 

Hispanic 

9,637 

7.43 

1.50 

-0.26 


Other 

3,427 

7.94 

1.50 

0.08 


White 

82,850 

7.87 

1.45 

0.04 


No Response 

5,353 

8.02 

1.56 

0.13 


No High School Diploma 

2,602 

7.16 

1.48 

-0.45 


High School Diploma 

25,957 

7.36 

1.44 

-0.31 

Highest Parental 

Associate Degree 

7,965 

7.45 

1.39 

-0.25 

Education 

Bachelor's Degree 

39,758 

7.85 

1.44 

0.02 


Graduate Degree 

38,530 

8.21 

1.47 

0.26 


No Response 

6,085 

7.80 

1.54 

-0.01 


English 

111,520 

7.84 

1.48 

0.01 

Best Language 

English and Another 

6,488 

7.63 

1.56 

-0.13 

Spoken 

Another 

1,244 

7.06 

1.63 

-0.51 


No Response 

1,645 

7.70 

1.64 

-0.08 


English 

98,948 

7.82 

1.47 

0.00 

First Language 

English and Another 

13,568 

7.86 

1.56 

0.03 

Spoken 

Another 

6,812 

7.71 

1.60 

-0.07 


No Response 

1,569 

7.77 

1.63 

-0.03 

Total 


120,897 

7.82 

1.49 


Note: For all subgroups, standardized differences (d) are calculated as (Subgroup Mean minus Total Mean)/Total 
Standard Deviation. 
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Table 2. 

The Relationship Between SAT Essay Score and Academic Performance 



n 

M 

SD 

d 


5(2110-2400) 

5,062 

9.56 

1.34 

1.17 

SAT (Critical 

4(1810-2100) 

30,746 

8.65 

1.29 

0.56 

Reading + Math + 
Writing Multiple 

3(1510-1800) 

51,536 

7.77 

1.25 

-0.03 

Choice) 

2(1210-1500) 

29,222 

6.97 

1.27 

-0.57 


1 (600-1200) 

4,331 

6.04 

1.42 

-1.19 


Above A- 

72,355 

8.06 

1.45 

0.16 

HSGPA 

A- to B+ 

23,137 

7.66 

1.44 

-0.11 

B+ to B- 

21,976 

7.33 

1.45 

-0.33 


Below B- 

3,429 

6.82 

1.49 

-0.67 


Above B+ 

42,473 

8.28 

1.43 

0.31 

First-Year GPA 

B+ to C+ 

59,183 

7.71 

1.43 

-0.07 

C+ to D + 

16,144 

7.18 

1.45 

-0.43 


Below D+ 

3,097 

6.79 

1.48 

-0.69 


Above B+ 

56,600 

8.10 

1.43 

0.19 

First-Year Eng 

B+ to C+ 

46,272 

7.70 

1.45 

-0.08 

GPA 

C+ to D + 

12,989 

7.29 

1.53 

-0.36 


Below D+ 

5,036 

6.97 

1.52 

-0.57 

Retention to 

Yes 

107,950 

7.87 

1.48 

0.03 

Second Year 

No 

12,947 

7.37 

1.50 

-0.30 

Total 


120,897 

7.82 

1.49 


Note: For all subgroups, standardized differences (d) are calculated as (Subgroup Mean minus Total Mean)/Total 
Standard Deviation. 
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Mean FYGPA by SAT Essay Score, Grouped by Student Achievement Levels Based on HSGPA and SAT Scores (CR + M + W Multiple Choice) 
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Figure 1. 
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